Combining Mllr Adaptation and Feature Extraction for Robust Speech Recognition in Reverberant Environments
نویسندگان
چکیده
This paper presents an investigation on speech recognition performance in reverberant environments. Reverberant noise has been a major concern in speech recognition systems. Many speech recognition systems, even with state-of-art features, fail to respond to reverberant effects and the recognition rate deteriorates. This shows the limitations of robust feature extraction in reverberant environment. The maximum likelihood linear regression (MLLR) adaptation scheme is adopted for reverberant speech recognition on the TI-DIGIT database. The use of adaptation data improved the recognition performance significantly especially for strong reverberations. The performance of both MFCC 0 and MFCC 0 D A features improved by more than 10% for reverberations greater than 0.4s. This paper also demonstrates the optimal strength of both robust feature extraction and adaptation scheme for reverberant speech recognition. The recognition performance is maintained above 90% up to reverberation time 0.5s using both schemes.
منابع مشابه
Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملSpeaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملRobust Asr in Reverberant Environments Using Temporal Cepstrum Smoothing for Speech Enhancement and an Amplitude Modulation Filterbank for Feature Extraction
This paper presents techniques aiming at improving automatic speech recognition (ASR) in single channel scenarios in the context of the REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge. System improvements range from speech enhancement over robust feature extraction to model adaptation and word-based integration of multiple classifiers. The selective temporal cepstrum ...
متن کاملFast Adaptation for Robust Speech Recognition in Reverberant Environments
We present a fast method, i.e. requiring little data, for adapting a hybrid Hidden Markov Model / Multi Layer Perceptron speech recognizer to reverberant environments. Adaptation is performed by a linear transformation of the acoustic feature space. A dimensionality reduction technique similar to the eigenvoice approach is also investigated. A pool of adaptation transformations are estimated a ...
متن کاملNoise adaptation for robust AURORA 2 noisy digit recognition using statistical data mapping
The mismatch between system training and operating conditions often has negative influences on automatic speech recognition (ASR) systems. Noise in the operating environments is commonly encountered. ASR model adaptation is an important way to enhance the system performance in noisy environments. This paper proposes a feature-based statistical data mapping (SDM) approach for robust noisy digit ...
متن کامل